CDS
Accession Number | TCMCG075C30110 |
gbkey | CDS |
Protein Id | XP_017985026.1 |
Location | join(18064543..18064729,18065244..18065326,18065423..18065542,18066929..18067005,18067434..18067497,18067809..18067879,18068029..18068106,18068200..18068245,18069557..18069737,18069820..18069872,18070029..18070102,18070233..18070350,18071059..18071146,18072115..18072230,18073520..18073647,18073748..18073827,18074541..18074782,18074857..18075021,18075150..18075314,18075399..18075473,18076075..18076178,18076260..18076335,18076785..18076913,18077023..18077163,18077733..18077945,18078345..18078599,18079151..18079564) |
Gene | LOC18587096 |
GeneID | 18587096 |
Organism | Theobroma cacao |
Protein
Length | 1180aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_018129537.1 |
Definition | PREDICTED: DNA-directed RNA polymerase I subunit 2 [Theobroma cacao] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGGAAAGGCAGAGGAGGAAGAGGCCGGGGCCGTTTTCGGACGCGGAAGAGCTAGAAGAACTAAAGGAGCTATTCAAACACCATATCGAGTCGTTCGATTACATGATCGACGAAGGCTTAGACCTTATGCTTAAGCGCGTCAAGCCTGTCCAAATCTTCGATTCTTCCTCCAATAAAACCCTTAGAATCTGGCTGGATCATCCGGAGGTCTATCCACCGCAGAAGGACCGGTCATCAAAGACATCAGCAGGAGCTTTGTATCCATTTGAATGTAGACAAGCAAAAATTTCTTACACAGGGAGTTTCCATATAGATGTTTGCTTTCAGTGGGATGGTGGAGTTGTTGTAAGAGAAAAGTTAAATTTTGGAGAGTTTCCTATAATGTTAAAGTCAAAACGTTGTTACTTGCGAGAAGCTGATCCCAGAAAACTGGTTGCTTGCAAAGAAGAGTCGCGAGAAATGGGTGGTTACTTCATTCTGAATGGGCTTGAGAGAGTAGTTCGACTTTTGATATTGCCGAAACGGAATTATCCAATGAGTTTGGTACGTAATTCATTTCGTGATCGTCGGGAAGGGTATACTGATAAAGCAGTTGTCATAAGGTGTGTGAGAGGAGATCTATCATCAGTGACAGTTAAGTTATATTATCTTCATAATGGAAGTGCAAGACTTGGATTCTGGGTACAGGGGAGGGAATACATGCTTCCTGTGGGCATTATACTAAAGGCTCTGATTGACACTAATGATCGTGAAATTTATACAAAATTGACATGCTGCTATAATGAGAAAAATGGTGAAGGAAAGGGGGCTGTTGGCACTCAACTCGTTGGTGAAAGGGCCAAGATTATTCTTGATGAAGTTCAGCACTTGGCTCTTTTCACTCAGGAGCAGTGTCTACAGCATATTGGGGAACACTTCCAACCTATTATGGAGGGAATGGAAAGTGAGAGTTATTCTACTGTTGCTGATGTGGTGCTGAGGAATTACATATTTGTTCACTTGGATGACAACAATGACAAGTTTAATCTGCTCATCTTTATGGTGCAGAAACTTTTCTCACTTATAGATCAAACTTCTGCACCAGATAACTCAGATTCCTTGCAAAATCAGGAAGTTTTACTCCCTGGTCATCTCATTACCATTTACCTTAAGGAGAAACTGGAAGATTGGTTGCGCAAAGGAAAGAAGCTTATTGAAGATGCGATTAATAACAAAAGCAAAAATTTTGATTTCTGCAGCATGAAAGATGTCAAGAAGGTGATGGAAAAAAATCGTCCAACGCAAGTCAGTGCAGCAATTGAGAATTTGTTGAAAACTGGAAGATTGATAACACAGACAGGTCTAGATTTACAGCAGAGGGCTGGTTTCACGGTTCAGGCAGAGAGGCTTAACTATCATCGATTTCTTTCGTTTTTTCGGTGTGTTCATCGTGGGGCTTCATTCGCCGGACTTCGTACGACCAGTGTTAGGAAGTTGTTACCTGAATCTTGGGGTTTTCTTTGCCCCGTGCACACTCCTGATGGGGAGCCTTGTGGGTTGTTGAACCACATGACATGTACTTGTCGAATTACATCCTACTACAATTCGCCAGGAAATATTAGAGATTTTTTTAAAATAAGAATGTCTATCCTTGATGTTCTAGTTGGGGTTGGAATGACAACTTCTTGGCCAAAAGTTGATCATGCTGGACCTCCTCAAGTTCTTCCTGTTCTTTTAGATGGTCGTGTTGTGGGCTCTTTACCTTCTGGTGAAGCTGAAAAAGTTGTTGCTCATTTGCGGAGATTGAAATTAGCAGCTGCTTCAGTGATTCCTGATGACTTGGAAGTCGGCTATGTTCCTTTGAGCTTGGGTGGCACGTATCCTGGTTTGTACCTGTTTACTTCTCCATCTAGATTTGTTCGGCCTGTCAGAAATATTTCTATCCCTTCGGCAGATGGGAAGGATATTGAACTTATTGGGCCATTTGAACAGGTTTTCATGGAAATCAGATGTCCAGATGGTGGGAATGGAGGAAGAAGTAATATTTTTCCCGCAACTCATGAAGAAATTAGTCCAACTGCAATGCTTAGTGTGGTTGCTAATCTTACGCCTTGGTCAGACCATAATCAAAGTCCACGGAATATGTATCAGTGCCAGATGGCAAAACAAACAATGGCTTTTTCTTTACAAGCAATTAATGCACGTGCAGATCAAAAGTTGTATCATCTTCAGACTCCTCAAACTCCAATTGTGCGCACAAAAACATATACAAAGTACTGCATGGATGAATATCCTTCAGGAACGAATGCAATAGTAGCTGTGCTGGCATATACAGGGTATGATATGGAGGATGCCATGATTTTGAATAAGTCATCTGTGGAACGTGGGATGTGTCATGGACAAATATACCAGACGGAAACTATTGACTTGGGTGATGATAAGAGCAAGTCAGATCGAGGTCAAAGAATTTTTAAAAGAGAACATTCAGATAGGTCAATATCTTCTTGTCTTGATTCAGATGGACTTCCACATGTTGGTCAGGTGATACGCCCAAATGAACCTTATTGTAGCACCATTAATCAGGTGACAAATTCAAAGAGACTCTACAATCACAAGGGTTCAGAAACTGTTATTGTTGACTATGTTGCAGTTGATACCAAAAGCAAGAAGCATCTTCAGAAGGCTAATATTCGCTTTCGACATCCGAGAAACCCTGTCATTGGTGATAAATTTAGCAGTAGACATGGGCAGAAAGGTGTTTGCTCTCAGTTGTGGCCAGATATTGATATGCCATTCTCAGGAGTTACAGGAATGCGCCCTGATCTTATAATCAATCCTCATGCATTTCCCTCAAGGATGACAATTGCAATGCTTTTGGAATCTGTTGCTGCTAAGGGAGGAAGCTTACATGGGAAATTTGTGGATGCAACACCATTTTCTGATTCAGTGAAGGAAGCTAAAGGAAAGACTGAGACAGAGTCTGAGTCTCTTGTTGATGAACTTGGTTCCATGTTAAGAGCTCGTGGGTTTAACTACCATGGAGTAGAGGTATTATACAGTGGGGTCTATGGAACAGAACTAACATGTGAGATATTTATTGGCCCTGTTTATTACCAGCGGCTTAGACACATGGTTTCTGACAAATATCAGGTTCGTGCCACTGGACAAGTCGACCAGATTACACGACAGCCTATCAAAGGAAGAAAGCGGGGTGGAGGTATACGTTTTGGGGAAATGGAACGAGATGCTATGCTTGCTCATGGGGCTGCGTATCTGTTGCATGATAGGCTCCATACATGTTCTGATTATCACATTGCTGACGTCTGCTCTCTGTGTGGAAGCATCCTTACGACGTCCATTGTCCAGCCACCAAAGCGAGTAGTTCGAGAGATTGGTGGGTTGCCTCCTGCAAGGGCTCCAAAGAAGGTTACATGTCATGCATGCCAGACAAGCAAAGGGATGGAGACCGTCGCAATGCCTTACGTTTTTAGATATTTGGCTGCTGAGTTGGCAGCTATGAACATAACAATGACCATACAGCTTAATAGTGGAGCTGGGGCTTGA |
Protein: MERQRRKRPGPFSDAEELEELKELFKHHIESFDYMIDEGLDLMLKRVKPVQIFDSSSNKTLRIWLDHPEVYPPQKDRSSKTSAGALYPFECRQAKISYTGSFHIDVCFQWDGGVVVREKLNFGEFPIMLKSKRCYLREADPRKLVACKEESREMGGYFILNGLERVVRLLILPKRNYPMSLVRNSFRDRREGYTDKAVVIRCVRGDLSSVTVKLYYLHNGSARLGFWVQGREYMLPVGIILKALIDTNDREIYTKLTCCYNEKNGEGKGAVGTQLVGERAKIILDEVQHLALFTQEQCLQHIGEHFQPIMEGMESESYSTVADVVLRNYIFVHLDDNNDKFNLLIFMVQKLFSLIDQTSAPDNSDSLQNQEVLLPGHLITIYLKEKLEDWLRKGKKLIEDAINNKSKNFDFCSMKDVKKVMEKNRPTQVSAAIENLLKTGRLITQTGLDLQQRAGFTVQAERLNYHRFLSFFRCVHRGASFAGLRTTSVRKLLPESWGFLCPVHTPDGEPCGLLNHMTCTCRITSYYNSPGNIRDFFKIRMSILDVLVGVGMTTSWPKVDHAGPPQVLPVLLDGRVVGSLPSGEAEKVVAHLRRLKLAAASVIPDDLEVGYVPLSLGGTYPGLYLFTSPSRFVRPVRNISIPSADGKDIELIGPFEQVFMEIRCPDGGNGGRSNIFPATHEEISPTAMLSVVANLTPWSDHNQSPRNMYQCQMAKQTMAFSLQAINARADQKLYHLQTPQTPIVRTKTYTKYCMDEYPSGTNAIVAVLAYTGYDMEDAMILNKSSVERGMCHGQIYQTETIDLGDDKSKSDRGQRIFKREHSDRSISSCLDSDGLPHVGQVIRPNEPYCSTINQVTNSKRLYNHKGSETVIVDYVAVDTKSKKHLQKANIRFRHPRNPVIGDKFSSRHGQKGVCSQLWPDIDMPFSGVTGMRPDLIINPHAFPSRMTIAMLLESVAAKGGSLHGKFVDATPFSDSVKEAKGKTETESESLVDELGSMLRARGFNYHGVEVLYSGVYGTELTCEIFIGPVYYQRLRHMVSDKYQVRATGQVDQITRQPIKGRKRGGGIRFGEMERDAMLAHGAAYLLHDRLHTCSDYHIADVCSLCGSILTTSIVQPPKRVVREIGGLPPARAPKKVTCHACQTSKGMETVAMPYVFRYLAAELAAMNITMTIQLNSGAGA |